Shortest Path Set Induced Vertex Ordering and its Application to Distributed 
Distance Optimal Multi-agent Formation Path Planning 
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Abstract — For the task of moving a group of indistinguish- 
able agents on a connected graph with unit edge lengths into 
an arbitrary goal formation, it was shown that distance optimal 
paths can be scheduled to complete with a tight convergence 
time guarantee [22], using a fully centralized algorithm. In this 
study, we show that the problem formulation in fact induces a 
more fundamental ordering of the vertices on the underlying 
graph network, which directly leads to a more intuitive schedul- 
ing algorithm that assures the same convergence time and runs 
faster. More importantly, this structure enables a distributed 
scheduling algorithm once individual paths are assigned to the 
agents, which was not possible before. The vertex ordering also 
readily extends to more general graphs - those with non-unit 
capacities and edge lengths - for which we again guarantee the 
convergence time until the desired formation is achieved. 

I. Introduction 

For the task of moving a group of n indistinguishable 
agents (or equivalently, robots or vehicles) on a connected 
graph unit length edges into an arbitrary goal formation, an 
efficient centralized algorithm in [22] schedules all agents 
from an initial formation (configuration) to a goal formation, 
along paths of which the total distance is minimal. Moreover, 
it was established that the schedule can be completed within 
n + £ — 1 time steps (I is the largest of the distances between 
all pairings of start and goal vertices), which is shown to be 
a tight bound. 

In this paper, we show that a directed acyclic graph (DAG) 
induced by the initial and goal formations admits an integral 
ordering of the vertices on the involved paths. The ordering, 
which may be used to compute the distance between any 
two vertices on a directed path of the DAG, is unique up 
to an additive constant and leads to the same convergence 
time guarantee. This more fundamental structure provides 
a smooth transition from the problem formulation to the 
solution, which is missing from the constructive proof of- 
fered in [22]. Based on this structure, once the initial agent- 
target assignment is completed, the agents, using only local 
(up to distance 2) communication among the neighbors, can 
achieve the desired formation within identical convergence 
time. To the best of our knowledge, this work provides the 
first multi-agent formation path planning algorithm that is 
both truly distance optimal and partially distributed, along 
with a tight convergence time guarantee (keep in mind that 
global distance optimality is not possible without direct 
or indirect global communication, implying that a fully 
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distributed planning algorithm is out of the question here). 
We implemented the algorithm which is accessible online 
(see Section |V). As we will see, the ordering also allows 
easy extension of the results to graphs with edges having 
arbitrary integer lengths and non-unit capacities (i.e., more 
than one agent may be traveling on the same edge at a given 
instant). This paragraph captures the main contributions of 
our study. 

When it comes to problems on formation, two subprob- 
lems come up. One of them is on the topic of formation 
control, which focuses on maintaining a formation of a group 
of vehicles; a desired formation, in these research, may be 
important for inter-vehicle communication or for maximiz- 
ing certain utility functions [4], [16], [24]. Graph theoretic 
approaches are quite popular here, probably because vehicles 
and inter-vehicle constraints can be represented naturally 
with vertices and edges of graphs. The second subproblem 
put more emphasis on how to achieve a desired formation 
(as opposed to stabilizing around a given formation) [3], 
[5], [7], [8], [11], [21], [12], [15], [19], [20], which is the 
problem we address in this paper. On research that appears 
most related to our problem, a discrete grid abstraction model 
for formation control was studied in [10]. To plan the paths, 
a three-step process was used in [10]: 1) Target assignment, 
2) Path allocation, 3) Trajectory scheduling. Although it was 
shown that the process always terminates, no characterization 
of solution complexity was offered. In contrast, we provide 
efficient algorithms for solving a strictly more general class 
of problems with optimality assurance. 

Generalizing the notion of formation to include multiple 
agents trying to agree on some common goal leads to 
the problem of consensus/rendezvous. This more general 
problem has remained a central research topic in control 
theory and robotics; see, e.g., [1], [2], [6], [13], [14], [16], 
[17], [18], [23], to list a few. An early account of the ren- 
dezvous problem, as a form of formation control, appeared in 
[1], in which algorithmic solutions are provided for agents 
with limited range sensing capabilities. An 71-dimensional 
rendezvous problem was approached via proximity graphs in 
[2]. For the consensus problem it is shown that averaging the 
behavior of close neighbors causes all agents to converge to 
the same behavior eventually [6]. We point out that, although 
this paper works with disjoint initial and goal vertex sets 
of n distinct elements each, the presented results can be 
easily generalized to any number of goal vertices between 1 
and n, thus covering additional problems such as multi-agent 
rendezvous. 

The rest of the paper is organized as follows. Section 



HI1 provides the problem formulation, an example, and its 
solution. Section [HI] constructively proves the existence of 
the afore mentioned vertex ordering on the induced DAG. 
Section [V] shows an application of the vertex ordering in 
scheduling a set of distance optimal paths for the agents. 
Section IVl then shows the scheduling algorithm can be easily 
turned into a distributed one. We generalize the graph to 
have integer edge lengths and capacities in Section [VI] and 
conclude in Section Wu\ 



II. Formation Path Planning on Graphs 

Let G— (V,E) be a connected, undirected, simple graph, 
in which V = {v,} is its vertex set and E = {(v,-,V/)} is its 
edge set. Let A = {a\,...,a n } ben agents that move with unit 
speeds along the edges of G, with initial and goal vertices on 
G specified by the injective maps xi, xq : A — > V , respectively. 
The set A is effectively an index set. For convenience, V, E 
also denote cardinalities of the sets V,E, respectively. Let <7 
be a permutation that acts on the elements of xq, (ctoxg) 
is a map that defines a possible goal vertex assignment (a 
target formation). 

A scheduled path is a map p\ : Z + — > V, in which Z + : = 
NU{0}. Intuitively, the domain of the paths is discrete time 
steps. A scheduled path pt is feasible for a single agent a, if 
it satisfies the following properties: 1) pi(0) =x/(a,). 2) For 
each i, there exists a smallest k m [ n £ Z + such that pi(k m i n ) = 
(o oxc)(ai) for some fixed a (i.e., same a for all 1 < i< n). 
That is, the end point of the path p\ is some unique goal 
vertex. 3) For any k > k m i n , Pi{k) = (<7 oxo)(ai). 4) For any 
< k < k min , (pi(k), Pi (k+l)) £ E or Pi {k) = Pi {k+\). We 
say that two paths pi,pj are in collision if there exists k £ Z + 
such that pi(k) = pj(k) (meet, or collision on a vertex) or 
(j>i(k),pi(k+ 1)) = (pj(k+l),pj(k)) (head-on, or collision 
on an edge). If p(k) = p(k+ 1), the agent stays at vertex 
p(k) between the time steps k and k+1. 

Problem 1 (Formation Control on Graphs) Given a 4- 
tuple (G,A,x/,xg), find a set of paths P = {pi, . . . ,p„} and a 
fixed (7 such that pi's are feasible paths for respective agents 
at's for this a and no two paths Pi,Pj are in collision. 

Note that in the definition above, we have implicitly 
assumed that edges of G have unit lengths and capacities. 
That is, an edge takes unit time for an agent to cross and no 
two agents can be on an edge at the same time. This implicit 
assumption is used throughout Section iHlMVl and relaxed in 
Section LYJ 

To familiarize readers with the problem and its solution, 
look at the example in Fig. Q] The underlying graph G 
is a 6x7 grid with holes. Assigning the top left cor- 
ner coordinates (0,0) and bottom right coordinates (6,5), 
xi(A) = {(0,z-l)},£ G (A) = {(6,i-l)},l <i<6. That is, 
we want to move the agents from left to right. A solution 
to this problem that is distance optimal is given in Table H] 
corresponding to a schedule of the multi-colored paths in Fig. 
Q] Here, distance optimality seeks to minimize the total path 
lengths of all agents. Each main entry of the table designates 




Fig. 1 . A 6 x 7 grid with some vertices removed. The colored discs on 
the left represent the initial formation and the gray discs represent the goal 
formation. The colored paths represent the paths (not yet scheduled to avoid 
collision). 



the coordinates of the vertex an agent should be staying at 
the given time step. 

TABLE I 



Time Step 
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2 


3 


4 


5 


6 


7 


8 


l 


0,0 


1,0 


2,0 


3,0 


4,0 


5,0 


6,0 


6,1 


6,1 


2 


0,1 


0,0 


1,0 


2,0 


3,0 


4,0 


5,0 


6,0 


6,0 


3 


0,2 


1,2 


2,2 


3,2 


3,3 


4,3 


5,3 


6,3 


6,2 


4 


0,3 


1,3 


1,4 


1,4 


2,4 


3,4 


4,4 


5,4 


6,4 


5 


0,4 


1,4 


2,4 


3,4 


4,4 


5,4 


6,4 


6,5 


6,5 


6 


0,5 


1,5 


2,5 


2,4 


3,4 


4,4 


5,4 


6,4 


6,3 



III. Formation Induced Vertex Ordering 



Algorithm 1 PlanShortestPathSet 
Input: G,A,xj,xq as described in Problem [TJ 
Output: Q= {qi,...,q„} 

1: for each iij £ x;(A) do 

2: run Dijkstra's algorithm to get shortest paths qij for all 

(ui,Vj)'s such that Vj £xq(A) 
3: end for 

4: run minimum weighted bipartite matching algorithm on the 

above set of n 2 paths to get a path set Q. 
5: return Q 



Given xj and xq, it is relatively straightforward to obtain 
a unscheduled path set Q = {q\, . . . ,q n } in which is 
a sequence of vertices such that each is adjacent to the 
next in the sequence (we use Q to distinguish these paths 
from the scheduled paths, which we denote as P). Such a 
procedure, generally known as the Hungarian algorithm [9], 
is outlined in Algorithm Q] (from [22]). Let head(qj), tail(qi), 
and len(qi) denote the start vertex, end vertex, and length of 
qi, respectively. The path set Q returned from Algorithm [T] 
has the following properties: 

Property 2 For all 1 < ; <n, head(qi) Gx/(A) and tail(qi) £ 
xq(A). For any two paths qi,qj, head(qi) ^ head{qf) and 
tail(qi) ^ tail(qj). 

Property 3 Each path qi is a shortest path between head(qi) 
and tailiqi) on G. 



Property 4 The total length of the path set Q is minimal. 

Constructively guaranteed by Algorithm Q] Properties 0and 
[3] ensure that the initial and goal vertices are paired up using 
shortest paths. Property 2] requires the total length of these 
paths to be minimal. From now on, Q is always assumed to 
be a path set satisfying properties |2]|4] It is not hard to see 
that Property [4] implies the following [22]: 

Property 5 If we orient the edges of every path qj S Q from 
head{q{) to tail{qi), no two paths share a common edge 
oriented in different directions. 

With a slight abuse of notation, let V (•),£'(•) denote the 
vertex set and undirected edge set of the input argument, 
which can be either a path, qt, or a set of paths, such as Q. 
We define an intersection between two paths as a maximal 
consecutive sequence of vertices and edges common to the 
two paths. Property [5] is a special case of a more general 
structure of the path set Q [22]. 

Proposition 6 The path set Q induces a directed acyclic 
graph (DAG) structure on E(Q). 

With Proposition [6] it is then possible to bound the total 
number of time steps necessary to schedule the path set Q. 
Somewhat surprisingly, the DAG structure on Q has an even 
stronger order property that does not hold in general for 
DAGs; this is where the contribution of this paper starts. To 
state this property, some definitions are needed for describing 
relationships between paths. Recall that two paths intersect (a 
symmetric relationship) if they share some common vertices 
or edges. Two paths q,,qj are linked (again a symmetric 
relationship) if either qt,qj intersect or both qt,qj are linked 
to some q^. A cluster Q c is a set of paths such that every 
pair of paths qi,qj £ Q c are linked. A path cluster Q c is a 
maximal cluster of Q if Q c is a cluster and no other path 
1i G Q\Qc is linked to a path qj 6 Q c . For each path qt £ Q, 
define a distance value function, di'.V — > Z + , such that for 

uev( qi ), 

u = head(qi), 

dist(head(q\),u) otherwise, 

in which dist(u,v) denotes the shortest distance between u,v 
on the graph G. Distance value functions can be defined 
similarly for an arbitrary set of vertices. Given the general- 
ized definition, we say that one distance value function, d', 
respects another one, d, if d' is defined for all of cf s domain 
and for any u,v on which d is defined, 

d'{u)-d'{v)=d{u)-d(v). (2) 

In an unscheduled path set Q, for any two paths qi,qj that 
intersect, it is not hard to construct a distance value function 
that respects both qi and qj. 

Lemma 7 If a vertex u* belongs to the intersection of two 
paths qt,qj G Q, then the value function 

d,(u) ueV(qi), 
d c (u*) +dj(u) — dj(u*) uEV(qj), 



respects both di and dj. 



di(u) 



(1) 



d c {u) = 




Fig. 2. Two intersections between two paths. 

Proof. By definition, d c respects c/,. If there is a single 
intersection (or common segment) between qi and qj, then d c 
respects dj since by Property [5] qi and qj cannot have edges 
oriented differently. If not, let v* be another intersection point 
of qi,qj such that that segments of qi,qj between u*,v* differ 
(see Fig. |2j. Denote the segment between u* ,v* as (Oi,(Qj 
for qi,qj, respectively. Note that (Qj must be oriented the 
same way between u*,v* by Proposition [6] We want to show 
that di(v*) =d c (v*), or 



(4) 



di(v*) = d c {v*) = d c (u*)+dj(v*)-dj(u*) 
= di(u*)+dj(v*)-dj(u*) 
di{v*)- di(u* )=dj(v*)-dj(u*) 
len(cOi) = len(a)j) 

The last equation of (0]) holds since otherwise, for example 
len(c)j) < len((Oj), then both path should have taken 
Since v* is arbitrary, we have shown that d c respects both 



qi,qj. 



□ 



We now show that ^} extends d c to a path cluster. 



Theorem 8 Given a path cluster, Q c = {q\, . . . ,q m } C Q, 
there exists a distance value function d c '. V(Q C ) — > Z + , such 
that d c respects di for all 1 < i < m. 

Proof. Lemma [7] proves the case for any path cluster with 
no more than two paths. To complete the proof, assuming that 
given a d c that respected a sub cluster {q\, . . . ,qk-\} C Q c , 
we show that the recursively definition 



dAu) 



d c {u) 



u e V({qi,...,q k -i}), 



d c (u*)+dk(u) —dk(u*) u£V(qi(), 



(5) 



(3) 



extends d c so that it respects <4 for a path q^ that intersects 
paths in {q\, . . . ,^-1} at some vertex u*. The case is trivial if 
qit intersects paths in {q\, . . . only once or q% intersects 

a single path q,,l < i <k— 1 more than once. 

The leftover case is that q^ intersects qi,qj, 1 < i < j < 
k—1, at u* , v* , respectively (the order of i, j does not matter), 
with u*,v* being q^'s earliest two intersections with paths 
in {qi, . . . ,qk-i}. We may assume that the intersection has 
the general structure illustrated in Fig. [3] (in the figure there 
are 7 paths on the undirected cycle; there could be fewer 
or more paths). We assume a single intersection (see Fig. 
[3]) between each pair of paths since we have shown that 
multiple intersections between the same pair of paths do not 



Fig. 3. A typical multiple path intersection. 



Fig. 5. Augmented paths. The dotted paths are the old paths, replaced by 
the green ones. Note that the two sets of paths (green and dotted ones) have 
the same initial and goal vertex sets. 



affect d c values. The other possibility for the assumption to 
be violated is that some path, for example q,, takes a different 
orientation from that shown in Fig. [3] When this happens, it 
is always possible to update {<?i, • • • to a new path set 
such that fewer paths are on the undirected cycle. We show 
how to handle this exception using an example in which qi 
takes a different orientation (see Fig. @); all other cases are 
similar. 

For the case depicted in Fig. |U we update qi,q k to q'[,q' k 
(green paths in Fig. @). It is straightforward to verify that 
Properties [2]|4] are not violated. Furthermore, d c as obtained 
for readily extends to q\ via d c (u*) since the 

segment of q k before u* does not intersect {qi, . . . ,q k -\ } by 
assumption. We are left with the same general setting we 
begin with, except that there is one fewer path (gj) on the 
cycle. 




Fig. 4. A scenario in which q/ has a different orientation compared to Fig. 



Returning to the case illustrated in Fig. [3] on the undi- 
rected cycle formed by intersecting paths, there is always 
an even number of paths between u* , v* besides q k (possibly 
after applying above path switching procedure many times). 
Let this even number be 2b and the 2b paths intersect at 
Ml,... ,U2b-\ (Fig. [3] shows the case where b = 3). Also let 
u* = uq,v* = U2b- To show that © extends d c to d k , we 
need to show dk{v*) -d k (u*) =d c {v*) -d c (u*). If this is not 
true, without loss of generality, assume that d k {v*) — d k {u*) > 
d c (v*) — d c (u*). For this case, we update segments of the 
paths as shown in Fig. [5] The update gives us a net gain of 



path length 

b-l b 

—dist{u*,v*) + y ^ dist(u2j,U2j+i) — /J dist (un- \ , Uii) 

i=0 i=\ 
6-1 

= d k (u*) - d k (v*) + }Xd c (u 2 i+i) - d c {u 2 i)) 
i=0 

b 

+ }Xd c (u2i-l)-d c (U2i)) 

= d k (u*)-d k (v*)+d c (v*)-d c {u*) < 0, 

which contradicts Property 2] We conclude that (|5) indeed 
extends d c to respect d k . □ 

IV. A Simple Path Scheduling Algorithm Based 
on the Induced Vertex Ordering 

Assuming that a time optimal schedule seeks to minimize 
the time it takes the last agent to reach its goal, the following 
was established in [22]: 

Lemma 9 Distance optimality and time optimality for Prob- 
lem [2] cannot be simultaneously satisfied. 

Furthermore, let I be the largest pairwise distance between 
a member of xj(A) and a member of xq(A), 

£= max dist(w,v). (6) 

yuexi(A),vex G (A) 

It was also shown in [22] that n + 1 — 1 time steps is 
sometimes necessary to schedule a shortest path set Q. It was 
then shown that an unscheduled path set Q can be turned 
into a scheduled path set P with a maximum of n+l—\ 
time steps, providing a distance optimal schedule with a 
tight scheduling time bound. We now show that the vertex 
ordering induced by xq leads to a scheduling algorithm with 
the same guarantees on the scheduled paths' qualities. The 
new algorithm has a better running time of 0(nV\ogn) and 
is easy to understand; it is not clear though, from a first look, 
that it should provide the said convergence time guarantee. 

By Theorem [S] each maximal path cluster Q c C Q can 
be assigned a distance value function d c that respects every 
qt S Q c - Since these individual d c 's have no common domain, 
they can be combined to give a global d c (for a fixed Q). 



Assuming such a d c , which can be obtained easily using ([5]). 
Before scheduling the path set Q, we introduce a subroutine 
to handle the scenario illustrated in Fig. [6] In the figure, 
Q = {qi,qi} with head(qj) — Uj,tail(qi) = V; for i = 1,2. This 
path set cannot be scheduled as is, since q\ is in the way 



completed) is no more than £q. However, as we have 
mentioned, Algorithm|2]provides a much stronger guarantee, 
as Theorem QT| will show, we apologize for the somewhat 
long proof but it seems more appropriate to have a long 
proof in this case than to split it into lemmas. 



Fig. 6. A path set Q that cannot be scheduled without modification. 

of q2- However, as agent a\ reaches v\, we can dynamically 
switch the goals of qi,q2- Note that the path set after this 
update still satisfies Properties [2]|4] For paths quqj, denote 
this path switching subroutine switch(qi,qj). 

Algorithm 2 ScheduleShortestPaths 
Input: G,Q,d c 

Output: scheduled paths, P = {p\ , . . . ,p n } 

let pi(0) = head(qj) for all 1 < i <n 
let v; = next(qi,head(qi)) for all applicable qi £ Q 
let t = 1 

while some v, is not fully scheduled do 

while some pi(t) is not set for the current t do 
pick a candidate path q, with largest d c (v,) 
if v; is not the same as any Pj(t) already assigned then 
Pi(t) = v; 

v; = next(qi,Vi) if qi is not fully scheduled 
if V; == tail(qi) and v; falls on some qj such that qj 
has yet to reach V; then 

switch(qi,qj) 
end if 
else 

Pi{t)=Pi(t-l) 
end if 
end while 
t=t + l 
end while 

return P = { Pu . . . ,p„} 
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10: 

11 
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The path scheduling subroutine is outlined in Algorithm 
IH in which the routine next{qi,v) returns the next vertex of 
path qi after vertex v. A path qi is fully scheduled if tail(qi) 
is assigned to piit) for some t. The scheduling routine never 
considers two paths qi . qj running in opposite directions since 
Property [5] excludes such cases. Essentially, the scheduling 
algorithm let all paths from Q take their respective courses 
simultaneously. Whenever two paths are competing for going 
to the same vertex, an arbitrary path is picked to go and 
the other one to stay put. With the switch^-, •) subroutine to 
guarantee that no deadlock can occur, it is straightforward to 
see that the process must converge since at each f, at least 
one agent will make progress toward its goal. That is, 

Proposition 10 Algorithm\2\terminates infinite time. 

Denote the total path length of Q as £q, then the con- 
vergence time (the time it takes for the formation to be 



Theorem 11 Algorithm\2\provides a schedule that takes at 
most n + £ — 1 time steps to complete. 

Proof. We constructively prove the theorem starting with a 
path set Q. Let Q c = {q\, ...,q m } be an arbitrary maximal 
cluster of Q, it is clear that the schedule of Q c is not affected 
by any other path Q\Q C ; hence, we only need to prove the 
claim for Q c . Moreover, we only need to prove the bound 
for the special case in which the routine switch(-, •) is never 
invoked, since we can effectively consider the "dynamic 
switching" all happen at time step t = 0. Note that it is easy 
to see, once we go through the proof, that it also holds if 
dynamic switching were performed on the fly. 

We want to schedule all agents, a\,...,a m , along 
qi,... ,q,„, respectively, starting at t — 0. Before starting, we 
create a list of numbers, H, indexed by possible d c values (as 
constructed in the proof of Theorem [8} in decreasing order. 
Since the cluster Q c is finite, H is also finite. An entry of 
this list, hj, is the number of agents whose current locations 
have a d c value of d. A list H may look like 



d 



5 4 3 2 1 -1 -2 -3 -4 
003021 2 



We note that for a fixed time step t, the only importance of 
the index d in list H is that it specifies the relative order 
of agents. In the above H, for example, /?3 = 3, as the first 
non-zero entry, means that there are 3 agents as the "front 
runners", followed by next non-zero entry h\=\, suggesting 
that there is 1 agent two steps behind. From this observation, 
we may negate the index d and at each time step t, align h\ 
with the first non-zero entry of H. The above H then becomes 

d : -1012345678 
h d : 0030120002 

We can also remove the leading (and trailing) zero entries 



d 
hd 



1 2 3 4 5 6 7 8 
3 1 2 2 



At this point, we partition the list into one or more sublists 
as follows. Starting at d = 0, we look at the sum of first k 
terms of H, 



(7) 



d=l 



If it ever happens for some k, starting at 1 , that Sk = k, we 
group these k terms of H into a sublist and work with it. We 
call these sublists contiguous sublists. Applying the partition 
procedure to H above, the first contiguous sublist, H' , is 

d : 1 2 3 4 5 6 
h'j : 3 1 2 



Now let us consider how the sublist may change after we let 
all agents start moving towards their respective goals. For 
the first group of 3 agents, at least one of them can move 
one step closer to its goal and at most two of them may not 
make any progress. The worst case happens in a situation 
illustrated in Fig. [7] When conflict like this happens, we pick 
a random agent to advance. Suppose the worst case happens, 
we update to have h' = \ ,h\ = 2. 



3 

-m- 



Fig. 7. A three-way conflict between three agents, in which case we pick 
any agent to go ahead and let the other two wait. 

For the rest of the non-zero entries of H' , if it is preceded 
by a zero entry, it means that there are no agents with d c 
values exactly one larger than this group of agents. Hence, at 
least one agent from this group of agents can make one step 
progress towards its goal. When this is applied to h' 3 , which 
has a value 1, we update the sublist entries as h' 2 — l,h' 3 — 0. 
After these two steps, the sublist H' , still being processed 
between time steps, is 

d : 1 2 3 4 5 6 
h' d : 1 2 1 2 

Observe that the next non-zero entry, h' 4 — 2, now has a 
preceding zero entry. Assume that only one agent advances, 
we update the entries to h' 3 = l,h' 4 = 1. At the end of t = 1, 
the updates give us H' as 

d : 1 2 3 4 5 6 
h' d : 1 2 1 1 1 

We can adjust d to get 



d : 1 2 3 4 5 6 
ti d : 1 2 1 1 1 

What does the entries of H' mean after each update? The 
entry h\ represents the number of agents (paths) that never 
waited for others. Similarly, the entry h\ represents the 
number of agents (paths) that never waited for more than 
i— 1 steps. Note that the leading entry is h\ = 1. Whenever 
the leading entry becomes 1, the associated agent/path can no 
longer have any conflict with any other agent/path. That is, it 
has no more interaction with the rest of the agents. We claim 
that throughout the updates, at least i agents never waited 
more than i — 1 steps, which is easily verifiable via induction 
(we omit the details due to its length and irrelevance to the 
rest of the paper). The worst case happens when H' becomes 
all l's as 

d : 1 2 3 4 5 6 



h' 



111111 



In any case, by the above claim, agents from a contiguous 
sublist cannot "spill over" to the next contiguous list. If we 
apply what we have done to every contiguous sublist, the 



claim that at least i agents never waited more than i — 1 
steps holds for all agents moving on the cluster Q c . Since 
no agent travels a length more than I, We have proved the 
claim of the theorem. □ 



V. A Distributed Scheduling Algorithm 

Looking closely at the constructive proof of Theorem Q~T] 
it is not hard to observe the following: 1. Different path 
clusters can be scheduled independently; 2. Within each path 
cluster, an agent only needs to be aware of its neighbors 
within a distance of 2 to take appropriate actions. These 
observations imply the possibility of a partially distributed 
planning algorithm that yields shortest total path length: 
Once agent-target assignment is done, it seems that global 
coordination is not required to schedule these agents. Since 
local communication is often more reliable and easy to 
implement, such a scheduling algorithm is more desirable in 
general. In this section, we provide a local communication 
protocol which leads to a distributed scheduling algorithm, 
again with a convergence time of n + £ — 1 . A common clock 
is assumed. Since the algorithm is essentially a distributed 
version of Algorithm [2] we omit the pseudocode. 

Assuming each agent is assigned a path, we will schedule 
them along these paths and possibly update their goals 
(targets) on the fly. Recall that with Property [5] we only 
need to worry about two agents occupying the same vertex 
at a given time step. This splits into two cases: 1. Two agents 
want to move to the same vertex in one time step, and 2. One 
agent moves to a vertex while another agent is staying there. 
We now give a communication protocol, including a forward 
communication phase and a backward communication phase 
at each time step, that handles both cases. 

Schedule 12 (Distributed Transfer Schedule) Repeat the 
following two communication phases until the desired forma- 
tion is complete. Forward communication phase. Assume 
that an agent a,- is located on v,- and wants to move to 
v, + i. Agent ai first checks whether v,- + i is occupied by some 
other agent aj and if it is, notifies aj of its intention and 
waits for aj 's response. At this point, aj will check whether 
it is already at its goal and if it is, switch its goal with 
a\ (aj will also redo its forward communication phase if 
it already did). If no agent is occupying aj, ai then looks 
for agents that also want to go to v,- + i. If there are, one 
agent is randomly picked to go to v I+ i in the next time step. 
Alternatively, we could deterministically pick an agent (e.g. 
based on identities of the vertices occupied by the agents). 
Other agents wanting to go to Vj+i then must wait one time 
step. Since we are dealing a finite number of agents and 
there are no cycles on a DAG, the forward communication 
will stop after at most 0(n) messages, each with a size of 
<9(logV). Backward communication phase. Next, an agent 
that has received requests from a following agent needs to 
respond back. Let such two adjacent agents be ai and aj, 
occupying vy v,+i, respectively, with ai wanting to go to 
V/+J. There are two subcases: 1. If aj will move, then it will 



notify that at it may go ahead and move to Vj+i. If aj gets 
multiple requests to occupy v,-+i then a randomly agent is 
selected to proceed (again, this can be made deterministic). 
2. If a i cannot move because another agent tells it so, then it 
simply relay that message backward. Clearly, the backward 
communication will stop after at most 0(n) messages, each 
with a size of O(logV). 

The distributed scheduling algorithm have a similar run- 
ning time compared with the centralized version. Using 
Theorem QT| and in particular the fact that at least i agents 
never need to wait more than i — 1 time steps, the following 
Proposition is immediate. 

Proposition 13 Schedule [72] transfers all agents to achieve 
the desired formation in 0(n + £ — I) time steps. 

The scheduling algorithm is fairly simple to implement, 
as we did in Java. For visualizing the result, a Java ap- 
plet, accessible onlin^H was created to illustrate the target 
assignment and scheduling process. For more details about 
the simulation, see the webpage given in the footnote. A 
snapshot of a running session is provided in Fig. [8] We do 
not provide computational evaluation here since the overall 
algorithm has similar running time as the algorithm from 
[22]. Readers interested in computational time on large 
instances may refer to that paper for more details. 




Fig. 8. A simulation capture. The red/blue circles and numbers are the 
start/goal locations (already assigned to have shortest total distance). The 
light blue solid discs represent the agents. The bold black lines are the paths 
yet to be completed. 



http://msl.cs.uiuc. edu/~ jyul 8 / pe/distr-f orm . html 
(a Java plugin of version 6 or higher is required). 



VI. Integer Edge Lengths and Capacities 

So far we have assumed that we work with a graph G 
with unit edge lengths and capacities. That is, an edge takes 
a unit of time to cross and can hold one agent at a time. We 
now relax this assumption to allow non-unit edge lengths and 
capacities. Formally, let d,c : E — > Z + be the edge length map 
and edge capacity map, respectively. We assume that for any 
e €E,d(e) > c(e), which is generally true for physical robots 
with non-negligible sizes (up to a multiplicative constant). 
The main goal of this section is to extend the results from 
previous sections under this setup. Note that the definition 
of scheduled paths and feasible paths from Section HI] need 
to be updated since it may take multiple time steps for an 
agent to cross an edge. Thus, a scheduled path p, becomes 
a partial map as it may be undefined for some time steps. 
We omit formal descriptions of these required updates since 
they are intuitive but lengthy to state. 

It is clear that Algorithm Q] is insensitive to edge length. 
Therefore, the algorithm again produces an unscheduled path 
set Q satisfying Properties |2]|5] Moreover, all results from 
Section||ll]continues to hold with edge lengths that are not all 
ones. On the other hand, scheduling the path set Q becomes 
slightly trickier, since depending on edge capacities, one or 
more agent may be on the same edge during within one 
time step. To simplify the analysis, we look at two extreme 
cases: 1. For all e e E,c(e) — d(e). 2. For all e £ E,c(e) = 1. 
The first case models scenarios that allow bumper to bumper 
road traffic. This case is easy to handle, due to the following 
observation: By subdividing each edge e EE into d(e) edges 
of unit length, we obtain a new graph G with unit edge length 
and capacity. We turn our attention to the second case, which 
models bottleneck edges such as a long and thin bridge. First 
we establish a lower bound. 

Lemma 14 Assume Ve G E,c(e) = 1 and let af max = 
maXeeE d(e). Then £+(n — l)d max time steps is necessary 
to schedule n agents along a shortest path set Q. 

Proof. In the instance of Problem Q] shown in Fig. [9] 
assume that all edges have the same length d; hence, 
<^max = d. The graph G is two stars with their centers 
connected by a single path; the red vertices form xj(A) and 
the blue ones xq(A). It is clear that all red vertices are of 
distance £ to all blue vertices. Given this problem instance, 
all agents must go through the path uv...xy sequentially. 
To optimize arrival time, the first agent (say ai) to reach 
goal must visit u at t = d max . Consequently, a\ cannot 
reach v earlier than t = 2<f max . This implies that no other 
agent can head to v from u before t — 2af max , due to the 
unit edge capacity constraint. Via simple induction, the 
last agent arriving at u cannot leave it before t = naf max . 
Therefore, it cannot arrive earlier than t =l+{n— 1 )af max . D 

If we pretend that all edges have the same length d m ax^ 
Algorithm |2] can be easily extended to schedule a shortest 
path set Q. Clearly, this provides an overestimate of the 
total time it takes to schedule Q. Since no agent is delayed 
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Fig. 9. An i nstance of Problem [T] for demonstrating the necessity claim 
of Lemma PhI 

more than (re — l)af ma x time steps, the following corollary to 
Theorem QT| is immediate. 

Corollary 15 Assume Ve G E,c{e) = 1 and let J m ax = 
max ee £c/(e). Algorithm^ schedules a shortest path set Q 
such that the scheduled path set requires at most t + (re — 
l)^max time steps to complete. 

Thus, the time bound I + (re — 1 )d maK is tight for the 
unit edge capacity case. Combining the two extreme cases 
together, we have the following conclusion. 

Theorem 16 For the extension of Problem Q] with integer 
edge lengths and capacities in which 1 < c{e) < d{e) for 
all e G E, the time bound £+ (n — l)c/ ma x is sufficient and 
necessary to schedule n agents along a shortest path set Q. 

Straightforward complexity analysis shows that for integer 
edge lengths and capacities, the running time of the entire 
algorithm becomes 0(nV 2 + nVd max ). 

VII. Conclusion and Future Work 

In this paper, for the multi-agent formation path planning 
problem on graphs, we showed the existence of a vertex 
ordering structure induced by the initial and goal formations, 
which in turn admits a simple and natural scheduling algo- 
rithm for coordinating the shortest paths amongst the indis- 
tinguishable agents with a tight convergence time guarantee. 
Furthermore, the ordering allows the scheduling algorithm 
to be distributed. We then showed that the ordering as well 
as the convergence time guarantee generalize to integer edge 
lengths and capacities. 

Seeing how the vertex ordering helped us in obtaining 
a distributed scheduling algorithm without sacrificing con- 
vergence time, we plan to study further implications of 
this order structure. On the practical side, we hope to put 
the algorithm onto robots to test its performance in real 
world applications. With increased availability of cheap and 
fast wireless communication capabilities, we believe our 
algorithm can be used on formation control problems for 
a large group of robots or other types of vehicles in practice . 
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